CDS

Accession Number TCMCG024C07581
gbkey CDS
Protein Id XP_021978238.1
Location join(107183268..107183424,107183516..107183598,107184203..107184297,107184441..107184544,107184639..107184808,107184897..107185011,107185082..107185137,107185226..107185264,107185379..107185477,107185585..107185647,107185721..107185783,107186307..107186389,107186471..107186579,107186661..107186714,107186854..107186941,107187464..107187501,107187578..107187683,107187777..107187943,107189582..107189641)
Gene LOC110873597
GeneID 110873597
Organism Helianthus annuus

Protein

Length 582aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022122546.2
Definition imidazole glycerol phosphate synthase hisHF, chloroplastic [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category E
Description Belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATACGACGACGTTCAATTCTTCAAACTTCACCGCCGTTCCTGGTTCCCTAGGGTTACGAAACCGGCCACTGAAACTACACCGAACCGCCATTCATTTCAAATCTTTTACAAACTTCTCTGTTCGTGCATCCGCTGTTGGAGCCAGTGATTCCACTGTGACTCTGCTTGATTATGGTGCTGGTAATGTTCAGAGTATTAGAAATGCAATTCGTTATCTAGGTCTGGATATCAAAGATGTTCAAACACCAGAGGACATTTTGAATGCAAAACGTCTTATATTTCCTGGGGTTGGAGCTTTTGCTTCAATGATGGATGTTTTAAACAAGAACGGGATGGCTGAAGCTCTTCGGGCATACATCGAGAACGATCGTCCATTTTTAGGCATTTGTCTCGGTCTGCATTTACTCTTTGAGTCAAGTGAAGAAAACGGGCCTGTGAAAGGTCTAGGTTTGATTCCTGGGGTGGTTGGACGTTTCGATTCTTCTAATGGTTGCAGAGTGCCTCATATCGGTTGGAACGCTCTGCAAATTAAAAACGATTCAGTTATTTTGGATGAAATTGCAAATCGCCATGTGTATTTTGTTCATTCTTACCGAGCTGTTCCCTCAGAAGAAAATGAAGAGTGGATTTCATCCACCTGCAATTACGGGATTGACTTCATATCATCTATTAGAAGGGGAAATGTACACGCGGTTCAGTTTCACCCGGAGAAAAGTGGAGATGTTGGTCTTTCAATATTGCGAAAGTTCTTGTTGCCAAATTCATCCATAACTAAGAAGTCATTTGAAGGGAAGGCTACAAAGCTTGCAAAGAGGGTAATTGCTTGTTTGGATGTGAGAACAAATGATAATGGCGATCTTGTTGTTACAAAAGGTGATCAATATGATGTGAGAGAACAGACAAAAGAGAATGAGGTGAGGAACCTGGGTAAGCCAGTTGAACTTGCTGGACAGTATTACTTAGACGGTGCTGACGAGGTTAGCTTTTTGAATATCACCGGGTTTCGTGATTTCCCTCTGGGTGATTTGCCAATGTTGCAGATTTTGAGGTACACATCAGAGAATGTTTTTGTACCATTGACAGTTGGCGGTGGCATTCGAGATTTCACGGATGCGAATGGCAGGTATTATTCTAGTTTGGAAGTTGCTTCAGAGTATTTCAGATCTGGCGCAGATAAGATTTCTATCGGAAGTGATGCTGTTTATGCTGCAGAAGAGTATTTAAAAACAGGAGTAAAAACTGGTAAGAGCAGCTTAGAACAAATTTCCAGAGTCTACGGAAATCAGGCAGTGGTTGTAAGCATCGACCCTCGTAGACAATATTTGACCAGTCCTTATGAAGTTGGATTTAAATCAGTTAAAGTAACAAACTTGGGACCAAACGGTGAAGAGTATGCCTGGTATCAGTGTACAGTTAATGGTGGAAGAGAGGGTCGACCAATAGGAGCTTATGAGCTTGCAAAAGCCGTTGAAGAACTAGGAGCTGGAGAAATATTGCTCAACTGTATTGACTGTGATGGTCAAGGAAAGGGATTTGATATTGATCTGATAAAACTGATATCTGATGCTGTGAGCATTCCTGTAATTGCAAGTAGCGGTGCAGGAAAAGCCGAACATTTTTCCGAGGTTTTTTCACAAACTAATGCTTCTGCAGCTCTTGCTGCTGGGATTTTTCACAGGAAAGAGGTACCGATTCAGTCAGTAAAAGACCATTTGTTGAAGAAAGGCATTGAAGTAAGGATGTAG
Protein:  
MDTTTFNSSNFTAVPGSLGLRNRPLKLHRTAIHFKSFTNFSVRASAVGASDSTVTLLDYGAGNVQSIRNAIRYLGLDIKDVQTPEDILNAKRLIFPGVGAFASMMDVLNKNGMAEALRAYIENDRPFLGICLGLHLLFESSEENGPVKGLGLIPGVVGRFDSSNGCRVPHIGWNALQIKNDSVILDEIANRHVYFVHSYRAVPSEENEEWISSTCNYGIDFISSIRRGNVHAVQFHPEKSGDVGLSILRKFLLPNSSITKKSFEGKATKLAKRVIACLDVRTNDNGDLVVTKGDQYDVREQTKENEVRNLGKPVELAGQYYLDGADEVSFLNITGFRDFPLGDLPMLQILRYTSENVFVPLTVGGGIRDFTDANGRYYSSLEVASEYFRSGADKISIGSDAVYAAEEYLKTGVKTGKSSLEQISRVYGNQAVVVSIDPRRQYLTSPYEVGFKSVKVTNLGPNGEEYAWYQCTVNGGREGRPIGAYELAKAVEELGAGEILLNCIDCDGQGKGFDIDLIKLISDAVSIPVIASSGAGKAEHFSEVFSQTNASAALAAGIFHRKEVPIQSVKDHLLKKGIEVRM